Specification 

Method for Detecting Mutations in Nucleotide Sequences 

Fj^nfthR Invention 

The present invention relates to a method for detecting mutations 
(nucleotide substitutions) existing in nucleotide sequences by detecting 
mismatched base pairs. The present invention further relates to a method 
for detecting mutations existing in nucleotide sequences which can 
simultaneously detect expression level of genes having the mutations. 

In the medical and biological fields, methods for detecting genetic 
expression levels and mutations thereof are important methods that are very 
frequently used in identification of unknown genes and diagnosis of diseases. 
The methods for detecting expressed genes can be roughly classified into two 
categories One includes methods utilizing visualization of cDNA, and the 
other one includes microarray methods where cDNAs already isolated in 
genome analysis are attached on a substrate, and detection is performed by 
hybridization. 

As for the first group, the cDNA visualization methods, there have 
been reported several method such as the differential display method (Liang, 
P and Pardee, A. Science 257, 967-971), the molecular indexing method 
(Bemner, S. and Livak, K.J. Proc. Natl. Acad. Sci. USA, 86, 8902-8906 
(1989)) and RLCS (restriction landmark cDNA scanning (Suzuki, H. et al. 
Nucleic Acid Res. 24, 289-294 (1996)). Because DNA fragments are 
separated and detected by electrophoresis in these methods, nucleotide 
substitutions on the DNA fragments can be detected during the separation 
by using the SSCP method (Orita, M. et al. K. Genomics 5, 874-879 (1989)), 
DGGE method (Denaturing Gradient Gel Electrophoresis, Myers, R. M. et al. 
Method Enzymol. 155, 501-517 (1987)) and the like. In other words, 
polymorphisms of DNA can be detected, and used for the linkage analysis, 
direct detection of mutations and the like. 

On the other hand, with recent rapid progress of genome science, a 
large amount of cDNA (EST) have been isolated at all stages including 
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developmental stage and from a* tissues, and their nucleotide sequences 
have been determined, though they are partial sequences. Furthermore 
structures of full length genes are recently being determined by usmg full 
length cDNA libraries, and accumulated in the form of data bank and clone 
bank. There has been reported a method called cDNA nucroarray 

_ - t> \t„>i Sri I ISA, Ho. 

expression monitoring (Schena, M. et ai. rruc. Nau. A„_. &cl 
10614-10619 (1996)), which utilizes the large amount of clones obtained from 
such cDNA projects as mentioned above. In this method, expression of 
genes corresponding to the clones mentioned above is detected based on 
hybridization between cDNA fixed on a substrate and a probe comprising 
mRNA obtained from a specimen and labeled through reverse transection. 
This method enables assay for all of the isolated cDNA (EST), and it can 
determine all of the isolated DNA fragments (cDNA (EST)), even ^though .the 
sizes of genome information and genome bank are expanded. In addition, 
because this method does not rely on PCR, signal intensity corresponds to 

genetic expression level. 

Among the methods of the two groups mentioned above, the cDNA 
visualization methods have the following drawbacks. That is, information 
about which signal visualized by these methods correspond to which gene 
has not been sufficiently established, and therefore, when an interesting 
signal is detected, a DNA fragment corresponding to the signal must be 
isolated and collected. Furthermore, their sensitivity, i.e., detectable level 
of expressed genes, depends on the principle of each method. In particular^ 
as for those methods utilizing PCR such as the differential display method 
and the molecular indexing method, the detection of expressed level is biased 
by PCR, and sequences expressed in a small amount may not be detected, 
though they exhibit good sensitivity. Thus, they cannot accurately monitor 
the expressed amounts. On the other hand, in the methods not utihzing 
PGR such as the RLCS method, the expression level is monitored by the 
intensity of signals, but it cannot be considered that all of genes are detected 
as signals, because of their poor sensitivity. 

Further the microarray hybridization method has a drawback that it 
cannot detect small differences of nucleotide sequences such as point 
mutations because it detects signals by hybridization. 

Therefore, the object of the present invention is to provide a method 
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capable of detecting structural mutations in a plurality of genes in paraM 
P articular, a metfiod capable of detecting —1 notations wM 
Jultaneously monitoring their expression levels by ut^n the 
accumulating genome information, and the resource ^f^Z/ZZl 
the monitor of expression level and the ^*^£Z£Z 
(nucleotide substitutions) could be performed simultaneous, y oi — 
the structural mutations (nucleotide substitutes) occur can be ouiddy 
determined by using known genome information. 
" further object of the present invention is to provide reagents and 
apparatuses used for the aforementioned method. 

SSUnI, ^^onrelatest„amethodf„r detecting a nucleic acid 
fragment and/or PNA fragment having a mutation, which comprises 
W)Tstep of hybridizing at least one fragment among one or more fragments 
f^edta substrate, which fragments are seiectedfrom the group consisting 
« or more nucleic acid fragments and one or more PNA fragments, «* 
I Zt one fragment of which mutation is to be assayed, w^gmenU 
selected from the group consisting of one or more nucleic acid fragments and 

^"""reied substance, which is a substance specifically 
££l a mismatched base pair, to a mismatched base pair <™g 

^ISSSStL by the subst.ce hy detecting the 
label (referred to as the "first detection method" hereinafter). 

L present invention further relates to a method for detecting a 
nucleic acid fragment and/or PNA fragment having a mutation, which 

of hybridizing at least one fragment among one or more fragments 
Zl on a sub Late, which fragments are selected from the group costing 
Tol or more nucleic acid fragments and one or more PNA fragments, wUh 
at 1st one fragment of which mutation is to be assayed, 
selected from the group consisting of one or more nuderc acid fragments and 
one or more PNA fragments; 

Z a step of treating a mismatched base pair occurring between the 
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hybridized fragments with a substance specifically recognizing and deaving 
fragments starting from the mismatched base paw 

(EM step of labeUng a fragment remained on the substrate after the 

bindable to a mismatched base pair characterized in that it ^ed^ 

The present invention further relates to an art.de 
that it comprises a substrate on which surface one or more kinds of 
fragments or PNA fragments are fixed in a hybridrzable coupon. 

^^^rrthod for construction of Mut S-GKP fusion 

prote ";r;:iTshorl * - - «* * 

obtained in Example 3. 

Mnili" fe - ™rrvini- out the Inventus 

The present invention will be detailed hereinafter. 

SteP(A B„th of the first and second detection methods of the present 
invent Ize "a substrate on whicfc one or more fragments sdecte from 
the group consisting of one or more nucleic acid fragments » d «™^ 
PNA fragments are fixed". The aforementioned nuderc acid fragments 
Telle DNA fragments and RNA fragments, and the fragments to be fixed 

Err.-—: 

r ♦ „, PNA fragments different in their sequences and/or chain 
fThT ZZ to X be present on the substrate in a combination of 
frag^rind ^A fragments, DNA fragments and PNA fragments, 
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RN A fragments and PNA fragments, or DNA fragments and RNA fragments 
Id Pigments, and in such a case the fragments may be those duterent 
in their sequences and/or chain lengths. nMA fragments 

The fragments to be fixed on the substrate such as DNA fragments, 
RNA fragments and PNA fragment are not pabular! y UmUe so .ong as 
they are molecules having a nucleotide sequence. Tne ta. 6 - " . 

lot plicularly limited, and may be either cDNA, winch rs a genetrc 
JansCl or a genome DNA. It may be a DNA fragment having a part or 

all of full length cDNA of gene. 

An article comprising a substrate on which DNAfragments are fe d 
has been known as, for example, a DNA chip. Microarray 
eDNA are fixed, and methods for measuring expressron levels o 
^Inscription product utilizing them have already been known. The 
"of ^present invention is characterized in that it — 
detects expression levels of transcription products, winch are detected by the 
"ZZ chip, and expression levels of the mismatched base pars, whrch 
"be deteld by the measurement method utilizing the mrcroarray clnp. 
Tl be described hereinafter, information of the both can be obtau«d 
lultaneously by measuring the expression level of — 
with a signal different from the signals for measunng expressron levels 
transcription products. h 

More specifically, the nucleic acid «"* 
substrate may be, for example, full length cDNA, EST (part of cDNA), 

Examples of the genome DNA include plasmrds, phages, PAC, BAC, YAC 

""t aforementioned fun !engtb cDNA EST, genome DNA and the 
hie can be cut out and fixed on the substrate by a known method. In 

they are preferably fixed on the substrate at 
such as 3' or 5' end so that their whole nudeot.de sequences should not be 
fefd This provides an advantage that it can be easily detected because the 
Ty ridized p rtion will be separated away from the subsfrate. Speofica^ 

VZZ. Nu d l Acid Re, 24, 3U,SUS U9S6)), but the means 
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^ Tho aforementioned RNA fragments or riN«. xx 6 
nrp^pnt invention. The aioremenuuu i.^AD^nrS' 

end, and as a result they are nxea in y exam ple, a covalent 

^ * o^iietrate can be obtained by, tor example, 
of the fragments to the substrate can as 

hydrosyl group present on he s » b ^ ™ ^ material md the 

or PNA fragments chemically modified necessary 

size of the substrate is not particularly lunUed, and Aey can 

t^^-^'^^ZZZZ* a plurality o f 
— 5 to the rn^hod h^ pre 

nucleic add fragments and/or PNA fragm ^ ^ 

known are fixed on one substrate, '^ p^ fcagments having different 

„ r m ore Mnds or ^^.^^J^ may also be a 
fragments. The fragment winch is assayed to ^ ^ 

singl e (one) nucleic acid fragment or a -^"^JT „ f 

nucleic acid fragments or fiN a i g —nt, or PNA fragments 

^ ■ a n f nNA fraements, RNA fragments or riNn u. & 
or more kinds of DINA rragmei Wlirt w thev may be 

— in the, sequences and,or cham 1^ «^ ^ 

present in a combination ^ ^ entsandpNAfragmc n K ,orDNA 

fragment - " ^fragments, and in such a case the 
fragment and RNA fi gn« ^ ^ ^ ^ 

^eTa^ n : V -ayed for mutations (probe) may be, but not 
^ted" mZ and cONA, which are genetic — £ — 
genome DNA, RNA and/or PNA transcribed m vitro (m vitro tran 
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For example, when the fragment of which mutations and expression 
level should be detected is a cDNA fragment, mRNA can be isolated from an 
expression tissue at an intended expression stage, and a label can be 
introduced into the first cDNA full length chain by the method for 
synthesizing a first full length cDNA chain. The label of the fragments to 
be hybridized can be fluorescent, phosphorescent, luminescent substances, 
stable isotopes, radioactive substances and the like as described hereinafter. 
It is desirable, however, to use a label substance different from those of the 
substance recognizing mismatched base pairs and a labeling method 
described hereinafter, because expression level and mismatched base pairs 
can be simultaneously detected by doing so. A labeled probe is prepared 
and reacted with a substrate on which nucleic acid is fixed to prepare a 
hybridized molecule. The labeling can be performed by using fluorescent 
materials (rhodamine or fluorescein compounds), or radioactive isotopes such 

as 32 P and 3B S by a known method. 

The hybridization of the fragments fixed on the substrate and the 
fragment to be assayed for mutations can be performed in a conventional 



manner. 



Step (B) 

In the step (B) of the first detection method of the present invention, 
a labeled substance which specifically binds to a mismatched base pair is 
bound to the mismatched base pairs occurring between the hybridized 
fragments produced in the above step (A). 

The fragment to be assayed for mutations is hybridized with a 
fragment having a sequence exhibiting high homology to said fragment 
among the fragments fixed on the substrate. When a base substitution 
(mutation) is present in the fragment which should be assayed for mutations, 
a mismatched base pair is produced between the hybridized fragments. 

In the step (B), a labeled substance which specifically binds to a 
mismatched base pair is bound to the mismatched base pair. Examples of 
the "substance which specifically binds to a mismatched base pair" include, 
for example, mismatch binding proteins. Examples of the mismatch 
binding proteins include, for example, mismatch binding proteins derived 
from microorganisms such as E. coli and yeast, and mismatch binding 
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binds to it (Su, S-b. et ai., r than those 

(1986 ». ^f£zzi*££L»».*~*»>* 

mentioned above include, for example, descri bed in Fleck, O. 

* 4lh *"^ «MW A/C, A/G, Off, G/A and C/A 
has T/C, C/T, T/T, T/-, A/-, C/-, G/, QO. ^ rf ^ 

binding activities in addition to the T/G binding 
C/C mismatch «^ * — ^H-—— 

protein having C/C ^ Vol „, No . 24 , 5289- 

described in Fleck O. et al. Nudeic ^ ^ 

5295 This protein also has T/C, 

totheC :":st— that the binding affinity of 

It has been too patched base pans. For 

proteins may vary dependmg on the ton ^^hes or G/A 

, t x,„ Mut S protein readily detect w* 

w^^^^^^^-^jL, f or C/C 

C/C mismatch binding proW.n ^ ^ J, ^hes can 

The aforementioned substance specifically binding to 

(or eaamp.e, the group consisting 0 ~ * = P 
proteins, fluorescent proteins, luminescent radioact ive 
substances, pbosphorescent substances, rf ^ 

substances, antibodies, antigens, enzymes and proteins. 
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luminescent proteins include GFP (Green Fluorescence Protein) As an 
"of the antibodies, an anti-Mut S antibody can be — d to *e 
Mut S protein. Examples of the antics (antigen tags) nidude »s tag 
thioredoxin tag, HA (hemagglutination) tag, myc tag and the 
thioredoxin tag, an a*ti.thiored„xin antibody can be used and for ant, HA 
and an anti-myc antibodies can be used for HA tag ana myc a*, 
Tor bTotin, (strUavidin can be mentioned as the prCin. = W the 
enzymes include alkaline phosphatase, luciferase, aequorm and the hke. 

^ method for the labeling ca* suitably be selected depends on 
the kind of labels. For example, when the label substance is a 

I!, phosphorescent protein, fluorescent protein, ^ 
L the label substance can be used as a fusion protein. When the abel 
I" a fluorescent substance, phosphorescent substance stilble 
I^pe radioactive substance, antibody, antigen or the like, the label can be 
attached by a known chemical reaction or enzyme reaction 

The luminescent protein, Green Fluorescence Proton fflm-J. 
substance that emits luminescence with consumption of ATP by itseJi 
substance in a „ hs tance While this is a protein having a 

without requiring any auxiliary substance, vv 

relatively large molecular weight of 25 kDa, the gene for Mut S and a codon 
Z be ligated together to produce a fusion pro^, which hen can b 

used for detection of mismatched base pairs based on luminescence. Such a 
protein is very useful for detecting mismatched base pair sites, and its 

use is not limited to the use on microchips. 

The "labeled substances specifically binding to a mismatched base 

pair" fall within the scope of the present invention. 

St6P the step (C) of the first detection method of the present mention 
fragment, bound by the substance having the label are identifie _by 
d^g the label. The method for detecting the label can be suiUbly 
Hd depending on the kind of the label. For example, whence 
a luminescent protein, phosphorescent protem fluores ent proton, 

luminescent substance, fluorescent substance, ^^^Z °[ 

fhmrpscence or phosphorescence is detected by a 
thp like luminescence, fluorescence ui , 
luitawl detector. When the .abel is a stable isotope, radioactive material or 
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the like, radiation dose can be detected by any suitable means. When the 
,abel is an antibody or antigen, detection is performed by usmg 
antibody. When biotin is used, it can be detected by using avidm. When 
the label is an enzyme, a luminescent substance can be produced _ « ^ 
reaction product by selecting a suitable substrate. As the substrate, fo 
example a chemiluminescent substrate having a suitable chenulumines _ 
group Vg.. chlorine-substituted 1,2-dioxetane) can be mentioned for alkaline 
phosphatase, and luciferin for luciferase. 

In the first detection method of the present invention, a fragment 
bound by the substance specifically binding to mismatched base .pairs can be 
identified as a fragment having a mismatched base pair by detecting the 

Iabd ' Further, the first detection method of the present invention enables 
q uantification of a fragment having a mismatched base pair in addition to 
the identification thereof by introducing a label into a nudeic acid and*r 
PNA fragment to be assayed for mutations, and detecting the label of the 
nucleic acid and/or PNA fragment to be assayed for mutations. 

l%e label to be attached to the nucleic acid and/or PNA fragment to 
be assayed for mutations may be at least one selected from, for example, the 
^"listing of luminescent substances, fluorescent substances, 
Phosphorescent substances, stable isotopes, radioactive substanc^ 
antibodies, antigens, enzymes and proteins. Specific examples thereofa e 
substances similar to those mentioned in the explanation of the substance 
specifically binding to mismatched base pairs. f „ omlon , 
Furthermore, the quantification and identification of the fragment 
having a mismatched base pair can be simultaneously performed I by using a 
snbstle that produces a signal different from that produced by the labe 
attached to the substance specifically binding to mismatched base pairs as 
t label introduced into the nudeic acid and/or PNA fragment to e assayed 
for mutations. That is, the ratio of the molecules having a mismatched base 
pair contained in the hybridized fragments can be calculated from the resul 
0{ determination of expression level of the nucleic acid and/or PNA fragment 
to be assayed for mutations and the result, of the identification and 
quantification of the fragment having a mismatched base pan:, that can be 
obtained simultaneously. More specifically, intensity of signal (e.g., light 
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Stance specify binding to . 

nuamatdhed base pax- mo lecules formed on the 

Mismatched base pairs of the nyDrm 

^ wt*d bv using a substance specifically binding xo 
substrate are detected by using * ffMWlt from that of the 

pro be. For example , MutS *la^ In ^ a case , the signal under 

« " : ^" of M«t S, and the signal under an 

n „n-excitation condition is the sign ' fluoresce nce of the label 

excitated condition by a laser is a tota Lof ^thes gn rangeofflie 
„f the probe and the luminescent signal * fcom ^ of 

florescence wavelength of the fluorescent sub ^ '^^^^ 
the visible light of GFP, their intensity ratio can 

intensity of mismatched base P^J^J£ 
A. whole cDNA (transcript) to be assayed, 
expression freouency » "^^^ mvcntion , the expression 

rluTed on the'ame substrate, and therefore point mutations can be 
unit, and therefore it may be an extremelyeffectrve^ ^ 

and plants through linkage analysis. 



Step (D) 
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In the step (D) of the second detection method of the present 
In the step ) ^ hybndl2ed 

1 ZZ^ • — ^* -d deavinS 

fragments is treated witn hybrid ized fragments at the 

the mismatched hase pa* to cleave tta ^hyb ^ ^ 

mismatched base pair, or to remove at least a part 
fragments hybridised from ^^£~ m ^ P ta « pair, 
^stance specificaUy ~ A. the nucleases, 

for example, ^^^^ , md the like can he exemplified. 
^TthTdrino miTa* ^hedbasepairs.thesubstanceusedforU.e 

- rr ■ - « ^ ^ — of .e 

overlook them. Further, as desc depending on 

^match binding proteins for pom* - * ^ ^ ^ 

thekindofnnsmatchedbasep^ ^ ^ ^ 

""^^erefore in the second detection method of the present invention a 

mth a jt'STa base ta the 

7p l^rS^-^- on the substrate after the cleavage or 
step (D), and tne labeling after the cleavage or 

tb. removal are labeled » ^^LS^^ base pairs such as 
*. removal by usmg an = ~ ^ , ^ advantage that even 

«— - - - — 

single base mismatches can be detected. 



Step (E) 



in the step (E) of the second detection method of *. present 
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_r ^ u„ fnr Pxamole an enzyme reaction using a 
«;tPn (E) can be performed by, tor example, * 
^substrate. The enzyme action may »e polymerase reaction, 
reaction, hgation reaction, <T end addition redone. ■ th h£ 
When the 3' end addition reaction is used as the enzyme .eac^o. ^ - 
oTTelgments fixed on the substrate are preferably blocked beforethe 
LtL oTthe substrate, before or after the hybrid or before 
*toZ S the label in order to selectively introduce the label by ft. 3^ 
^reaction into only the fronts which have be*, sub^d * 
cleavage or removal. The above blocking can be performed by, for example 
Educing dideoxynudeotides in. the ends using termina, transferase or 

theUke The label substance of the substrate used for the enzyme reaction 
be for example one kind of substance selected from the group 
may be, for example fluorescen t substances, 

consisting of luminescent g substances> 
phosphorescent substance,, stable ~ p „ „ 

antibodies, antigens, enzymes and proteins. Specific ex P 
substances similar to those mentioned in the explanation of the substance 
specifically binding to mismatched base pairs. 

S,eP(F> in the step (F) of the second detection method of ^resent 
H„ n the labeled fragments are identified by detecting the label. The 

Tf the label, and specific examples thereof are similar to those explained 
the step (C) of the first detection method of the present invention. 

I the second detection method of the present invention , th 
fragments in which mismatched base pairs are produced can be identified by 
detecting the label attached in the step (E). 

Furthermore in the second detection method of the present 
inventil t rtification as well as quantification of the fragment havmg 
"itched base pair can be simultaneously performed by introducing a 
into the nucleic acid and/or PNA fragment » be assayed^ or 
mutations, and detecting the label of the nucleic acid and/or PNA fragme 
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to be assayed for mutations. 

The label for the nucleic acid and/or PNA fragment to be assayed for 
mutations may be, for example, one kind of substance selected from the 
group consisting of luminescent substances, fluorescent substances, 
phosphorescent substances, stable isotopes, radioactive substances, 
antibodies, antigens, enzymes and proteins. Specie examples t~ereo. are 
substances similar to those mentioned in the explanation of the substance 
specifically binding to mismatched base pairs. 

Furthermore, the quantification and identification of the fragment 
having a mismatched base pair can be simultaneously performed by using a 
substance that produces a signal different from that produced by the label 
attached to the fragments in the step (E) as the label introduced mto he 
nucleic acid and/or PNA fragment to be assayed for mutations. That is, the 
ratio of the molecules having mismatched base pairs contained in the 
hybridized fragments can be calculated from the result of determination of 
expression level of the nucleic acid and/or PNA fragment to be assayed for 
mutations and the results of the identification and quantification of the 
fragments having a mismatched base pair, that can be obtained 
simultaneously. More specifically, intensity of signal (e.g., light emission 
intensity) obtained from the label introduced into the nucleic acid and/or 
PNA fragment to be assayed for mutations, and intensity of signal (e.g., light 
emission intensity) obtained from the label attached to the fragments in the 
step (E) can be measured, and compared with prekminarily determined 
calibration curves respectively to determine the expression level of the 
nucleic acid to be assayed for mutations when it is a gene and the ratio of the 
fragments having a mismatched base pair. 

According to the present invention, there can be provided methods 
capable of simultaneously detecting structural mutations of a plurality of 
genes in particular, novel methods capable of detecting structural mutations 
while' simultaneously monitoring expression levels of the genes, utilizing 
accumulating genome information, and resources of genomic clones. 
Furthermore, according to the method of the present invention, because lt u 
possible to simultaneously monitor expression levels of the genes and detect 
their structural mutations (base substitution), it can be quickly determined 
in which genes the structural mutations (nucleotide substitutions) occur by 
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using known genome information. 

The present invention also provides a labeled substance specifically 
bindable to mismatched base pairs, and articles comprising a substrate 
having a surface on which RNA fragments or PNA fragments are fixed in a 
hybridizable condition, which are useful for the aforementioned method. 

The method of the present invention has the advantage that 
information of expression frequency and information of mismatched base 
pairs due to polymorphism or mutations can be obtained on the same 
su.bstra.te 

Moreover, by utilizing the method of the present invention, linkage 
analysis can be performed for family analysis of organisms by the detection 
of mismatched base pairs (polymorphisms) present in a plurality of genes 
derived from a plurality of individuals. It also enables searching of a gene 
sequence responsible for a phenotype by identifying a gene sequence having 
the largest number of mismatched base pairs through detection of 
mismatched base pairs for a mixture of a plurality of genes derived from a 
plurality of individuals commonly exhibiting the same phenotype. 

Examples 

The present invention will be explained more in detail with reference 
to the following examples. 

Example 1 

DNA to be fixed was prepared from TSH/3 plasmid (Hayashizaki, Y. 
et al. FEBS Lett. 188 (1985) 394-400). That is, 10 /zg of a plasmid vector, 
pBluescriptll having human TSHtfcDNA inserted into the cloning site, SstI, 
NotI, was digested with SstI and NotI, fractionated by agarose gel 
electrophoresis and purified to afford l//g of TSH/? cDNA. 

Amino group was introduced into the 3' end of this TSH/? cDNA as 

follows. 

TSH/? DNA 1 Ug 

2' ) 3"-dideoxy-5-(3-aminopropyn-l- yl)-UTP 0 5 M 

(Final concentration) 

Terminal Deoxynucleotidyl Transferase (TOYOBO, Japan) 50 units 

10 x Reaction buffer (TOYOBO, Japan) 5 V 1 
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A mixture having the above composition in a reaction volume of 50 
u\ was incubated at 37'C for 60 minutes. , „j 

After the reaction, the reaction mixture was treated with phenol, and 
subjected to ethanol precipitation to afford 0.5 « of 3' end aminated TSH 

" ^Then, 0.5 US of the V end aminated TSH 0 cDNA and sucinic 

anhydride at a final concentration of 5% were allowed to react in a volume of 
to form DNA to which 3' end was introduced with carboxy! group. 

This solution was mixed and immediately used for Ugation reaction to a 

substrate explained hereinafter. 

The substrate was made from a slide glass. A slide glass was 
treated with 100% trifluoroacetate at room temperature for one hour, dried, 
and treated by immersion in 2% APTES (aminopropyltriethoxysJane. Kanto 
KagaT) solution (water:acetone - 50:50) at room temperature for one day. 
TheCss was washed three times with acetone and once with acetomtnle to 

afford an aminated substrate. 

TSH/J DNAwiththecarboxylatedS'endwasfixedontheanmated 

.„ . ti,.,;, theTSH/? DNAwithcarboxylated3'endwas 
substrate as follows. Thatis, tne lanp 

mixed with carbodiimide at a final concentration of 5%, and 1 p.\ of the 
^ure was dotted on the aminated substrate. The substrate was then 
Abated at 50'C for 6 hours, and washed with washing solution 1 (10 mM 
Tris-Cl [pH 8.0,, 1 mM EDTA, 0.1% SDS), 0.!N NaOH, and — 
2 (10 mM Tris-Cl IpH 8.01, 1 mM EDTA), successively. Thus, single 
stranded DNA fixed on the substrate was obtained. 



EXamP clstruction of Mut S-GFP fusion protein expression plasmid is 

Sh " m Pi^Mut S gene (Schlensog, V. and Boecx. A., J. Bacterio!. 17, 
7414-7415 (1991)) of E. coli DH5a (Lifetech Oriental, U.S.A) was amphfied 
by FOR. A primer, ttg gta cfc gag atg agt gca ata gaa aat ttc gac^ which had 
modified in sequence around the initiation codon and 
site was used as the upstream primer, and a primer, cga cgt tgt cga cac cag 
^ caa gcg ata aat, which had been modified to have AccI site, was used 
^thedownsLani primer. The amplified DNA fragment was digested with 
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Xhol and AccI, and ligated to pEGFP-Nl (CLONETECH, U.S.AJ to afford 
Mut S-GFP, which « then introduced into DH10B (Lifetech Oriental, 

U S A.), and multiplied. 

The Mut S-GFP collected from DHlOa was digested with NotI and 
Xho, (Nippon Gene. Japan,, and hgated to pThioHisB ^Onvitrogen, 
Netherlands) at the Notl-Xhol site to construct HP-Thro-Mm . 

This was introduced into a host, E. coli TOP10 (Invitrogen 
Netherlands), and cultured at 25'C in LB culture medium. Express.cn of 
the fusion protein was induced by addition of IPTG. After the cultivation^ 
the cells were collected. The cells were disrupted by sonication, and 
centrifuged at 10000 g to obtain a supernatant as a soluble fraction. 

This soluble fraction was analyzed by Western blotting usmg an 
anti-thioredoxin antibody, and it was confirmed that a sufficient amount of 
the fusion protein was expressed. 

The obtained soluble fraction was purified by using a ProBond 
column (Invitrogen, Netherlands) according to the manufacturer's protocol, 
and the purified product was treated with enterokinase to cut out the HP 
thioredoxin portion and the Mut S-GFP portion. These were *> 
SDS electrophoresis, and a band of 130 kDa was excised. The product was 
renatured to obtain Mut S-GFP fusion protein. 



Example^ ^ ^ ^ ^ EMmple j was ^th a solution of 70 

mM succinic anhydride, 0.1 M boric acid, pH 8.0 35% l-methyl-2 f 
pyrrolidinone in order to eliminate non-specific hybridization. 0^1 * g of 
TSHtfdouble-stranded cDNA (Hayashizaki, Y. et al. FEBS Lett. 188 (1985) 
394-400) labeled with «P was dissolved in 11 Ml of 3.5 x SSC contaunng 4 
*g of poly(dA) (Sigma, U.S.A.), 2.5 *g of E. cob tRNA(Sigma, ^ 
g mouse CotI DNA (Lifetech Oriental, U S A), and 0.3 ^1 of 10/. SDS, 
boiled for two minutes, and cooled at room temperature. Hybridization was 
performed in a warm water bath at 62 -C for 14 hours. After the 
hybridization, the chip was washed with 2 x SSC, 0.2% SDS for . 
and with 0.2 x SSC for one minute. This chip was ™* ° C 

the Mut S-GFP fusion protein produced in Example 2 in 0.02 M KP0 4 , pH 7 A 
0 05 M KC1, 0.1 mM EDTA, 1 mM dithiothreitol, and 0.01% BSA by 
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incubation at 37°C for one hour. The chip was washed with a solution 
containing 0.02 M KP0 4 , P H 7.4, 0.05 M KC1, 0.1 mM EDTA, 1 mM 
dithiothreitol, and 0.01% BSA, and observed under a fluorescence microscope 
to determine whether the Mut S-GFP fusion protein had been bound to the 
chip. Then, autoradiography of the DNA chip was obtained to determine 
whether the labeled cDNA had been hybridized. The results are shown in 
Figure 2. 
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